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Abstract 

We critically examine uncertainties in parton distribution functions (PDFs) at large x arising 
from nuclear effects in deuterium F2 structure function data. Within a global PDF analysis, we 
assess the impact on the PDFs from uncertainties in the deuteron wave function at short distances 
■ and nucleon off-shell effects, from the use of relativistic kinematics, and from the use of a less 

, restrictive parametrization of the d/u ratio. We find in particular that the d-quark and gluon PDFs 

vary significantly with the choice of nuclear model. We highlight the impact of these uncertainties 
on the determination of the neutron structure functions, and on W boson production and parton 
luminosity at the Tevatron and the LHC. Finally, we discuss prospects for new measurements 
sensitive to the d-quark and gluon distributions but insensitive to nuclear corrections. 
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I. INTRODUCTION 



Parton distribution functions (PDFs) are traditionally determined via global fits to a wide 
variety of data for large momentum transfer processes [1-7] . Typical data sets include deep 
inelastic scattering (DIS) of charged leptons on proton and deuterium targets or neutrinos 
on heavy nuclear targets, lepton pair production on proton and deuterium targets, and the 
production of photons, vector bosons, or jets at large values of transverse momentum. The 
use of deuterium targets in charged lepton DIS and heavy nuclear targets for neutrino DIS 
brings in the added complication of having to account for the modifications of the PDFs in 
a nuclear environment compared to the PDFs of free nucleons. 

One of the key goals of such global fits is the determination of the flavor dependence 
of the extracted PDFs, i.e., determining the PDFs for each parton flavor separately. The 
classic method for disentangling the u and d PDFs is to compare charged lepton DIS on 
proton and deuterium targets. In order to concentrate on the extraction of the leading 
twist PDFs, cuts specifying minimum values of and are usually employed, thereby 
reducing the contributions of higher twist terms and target mass corrections. The effect of 
these cuts is to restrict the range in x to the region x < 0.7. In this region it is usually 
assumed that deuterium nuclear corrections are a few per cent or less and are, therefore, 
negligible. Accordingly, most global fits ignore deuterium nuclear corrections. 

In our previous analysis [1] the effects of systematically reducing the cuts on and 
were studied. This opened up the possibility of including data at larger values of x than 
those typically used in global analyses. Target mass effects were included along with a 
parametrization of higher twist terms. It was found that the leading twist PDFs were stable 
to variations in the techniques used to calculate the target mass corrections, provided that 
a sufficiently flexible higher twist parametrization was employed. However, it was observed 
that the d PDF was particularly sensitive to the assumptions used to calculate the nuclear 
corrections in deuterium. The purpose of the study reported here is to further investigate 
and quantify the effects of nuclear corrections on PDFs in global flts which make use of 
data obtained with deuterium targets, and to assess their impact on the neutron structure 
function determination, as well as on W production and parton luminosities at the Tevatron 
and the Large Hadron Collider (LHC). 

In Section II we present a detailed discussion of the nuclear corrections which must be 
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accounted for when using deuterium targets, including those arising from the conventional 
nuclear Fermi motion and binding, nucleon off-shell and relativistic effects, and nuclear shad- 
owing at small x. Section III contains a brief summary of the data used, along with the 
fitting procedures, and a detailed discussion of the flavor dependence of the various observ- 
ables considered in this study, with particular emphasis on the large-x region. Section IV 
discusses the results of our fits, which we refer to as "CJ" (CTEQ- Jefferson Lab) fits. Fu- 
ture experimental possibilities to reduce the uncertainty in the d-quark and gluon PDFs are 
outlined in Sec. V, and our conclusions are summarized in Sec. VI. 



II. NUCLEAR EFFECTS IN DEUTERIUM 



Because the deuteron is a weakly bound nucleus, most analyses have assumed that it 
can be treated as a sum of a free proton and neutron. On the other hand, it has long been 
known from experiments on nuclei that a nontrivial x dependence exists for ratios of nuclear 
to deuterium structure functions. While the ratio of the deuteron to isoscalar nucleon 
structure functions has yet to be measured directly due to the absence of free neutron 
targets, most theoretical studies suggest the presence of non-negligible nuclear effects also 
in deuterium, although at a level smaller than for heavy nuclei due to the dcuteron's small 
binding energy, Ed = —2.2 MeV. Particularly in the region x > 0.5 the effects of nuclear 
Fermi motion lead to a rapidly increasing ratio of deuteron to nucleon structure functions, 
Jld/N = F2/F2 , which diverges as x — >■ 1. Any high- precision analysis of the large-x region 
must therefore account for the nuclear effects if data on deuterium (or other nuclei) are used 
in the fit. 

In this section we first review the standard nuclear corrections in the deuteron, including 
the effects of nucleon Fermi motion and binding, formulated within the framework of the 
weak binding approximation (WBA) [8-10]. For most kinematics of relevance to global PDF 
analyses a nonrelativistic description of the deuteron is sufficient; however, at large values 
ol X {x> 0.7) the effects of relativity become increasingly important. We therefore explore 
the sensitivity of the nuclear corrections to relativistic kinematics, as well as to the short- 
distance nucleon-nucleon (NN) interaction parametrized through different nonrelativistic 
and relativistic deuteron wave functions. Finally, the dependence of the structure function 
of the bound nucleon on its virtuality is explored within several models and phenomeno- 
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logical parametrizations and its impact on the R^^^ ratio assessed. Although not affecting 
this analysis significantly, for completeness we also consider nuclear shadowing and meson 
exchange corrections in the deuteron at small values of x. 

A. Standard nuclear corrections 

The conventional approach to describing nuclear structure functions in the intermediate- 
and large-x regions is the nuclear impulse approximation, in which the virtual photon scat- 
ters incoherently from the individual nucleons bound in the nucleus [11, 12]. Early attempts 
[13-17] to compute the nuclear corrections in deuterium, as well as some more recent ex- 
perimental analyses [18-23], utilized simple ansatze to account for the nuclear smearing, 
some of which were rather ad hoc or suffered from various shortcomings (for reviews see e.g. 
Refs. [24, 25]). More recent efforts have focussed on deriving relations between the deuteron 
and free nucleon structure functions more rigorously within well-defined theoretical frame- 
works. 

Within a covariant framework the nuclear structure function can be written as a product 
of the virtual photon - bound nucleon and nucleon-deuteron scattering amplitudes [12, 26], 
which, for x not too close to 1, can be systematically expanded in powers of the bound 
nucleon momentum relative to the nucleon mass, p/M. Keeping terms up to order /M"^, 
the nuclear structure function can be written as a convolution of the bound nucleon structure 
function and the distribution of nucleons in the nucleus [8] . 

Because the struck nucleon is off its mass shell with virtuality — Pq — < M^, its 
structure function can in principle depend on p^, in addition to x and Q^. However, since 
the deuteron binding energy is only 0.1% of its mass and the typical nucleon momentum 
in the deuteron is |p| ~ 130 MeV, the average nucleon virtuality \/p^ will only be ~ 2% 
smaller than the free nucleon mass. One can therefore approximate the bound nucleon 
structure function for x not too close to 1 by its on-shell value, in which case the deuteron 
structure function can be written as a convolution of the free nucleon (proton p or neutron 
n) structure function and a momentum distribution /^v/d of nucleons in the deuteron. 




\d (conv) 





(1) 



The distribution /^v/d (also called the 



a, 



smearing function" ) is in general a function of the 
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deuteron's momentum fraction carried by the struck nucleon, y — {Md/M){p-q/pd-q)i where 
q is the virtual photon momentum, and pa and are the deuteron four-momentum and 
mass. At finite Q^, however, it also depends on the variable 7 = a/1 + Ax^M"^ /Q'^, which 
in the deuteron rest frame can be thought of as the virtual photon "velocity", 7 = \q\/qQ. 
In the Bjorken limit the function becomes independent of 7, but for moderate the 
dependence on 7 becomes important at x > 0.85 [10]. 

The smearing function /;v/d is computed from the deuteron wave function ^ci(p), and 
accounts for the effects of Fermi motion and binding energy of the nucleons in the nucleus, 
as well as kinematic l/Q^ corrections; in the WBA the combined effects of these are given 
by [9, 10] 



where Pz is the longitudinal component of p, and the separation energy e = po — M — 
Ma — Ep — M Ki Ea — /2M by expanding the recoil nucleon energy — sj + p2 
small p. The finite-Q^ correction factor is [9, 10] 

(72 - 1) (2p2 + 3pi 



C(y,7) = 

7 



1 + 



(3) 



y2 2M2 

where p±_ is the transverse nucleon momentum, and becomes unity in the Bjorken limit, 
C{y,^ ^ 1) — > 1. The resulting distribution function is sharply peaked around y ~ 1, with 
the width determined by the amount of binding (in the limit of zero binding, i.e., for a free 
proton or neutron, it would be a 5-function at y = 1). At finite (or 7 7^ 1) the function 
becomes somewhat broader, effectively giving rise to more smearing for larger x or smaller 
Q^. For nonrelativistic kinematics the variable y is bounded by ymin = x{l — 2edM/Q^) and 
Umax — 1 + 7^/2 + £(i/M, so that its maximum allowed value, even in the Bjorken limit, is 
approximately 1.5 instead of M^/M ?s 2. 

The deuteron wave function \l/rf(p) contains the usual nonrelativistic S- and i?-states, and 
is normalized according to J d^'p\'^^{p)f /{2nY' = 1. In the numerical calculations wc use 
several nonrelativistic deuteron wave functions, the high-precision AV18 [27] and CD-Bonn 
[28] A^A^ potentials, as well as the older Paris potential [29] for comparison with earlier work. 
As illustrated in Fig. 1, where we show the computed ratio R^^^ relative to that for the 
AVI 8 wave function, the differences between these are very small for x < 0.7. At larger x the 
differences become more visible, reflecting the greater role played by the large-y tail of the 
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FIG. 1: Dependence of the ratio R^^'^ = F2/F2 on the deuteron wave function, for the nonrela- 
tivistic CD-Bonn [28] (dashed) and Paris [29] (dotted) models, as well as the relativistic WJC-1 
(dot-dashed) and WJC-2 [30] (solid) models, relative to the ratio for the AV18 wave function [27]. 

smearing function, with the CD-Bonn wave function giving an approximately 10% smaller 
ratio for x ~ 0.9. The result with the older Paris wave function agrees with the AV18 to 
within 1% over this x range. The large- ?/ tail in turn reflects greater sensitivity to the short- 
distance parts of the A^A^ interaction, where the validity of the nonrelativistic approximation 
itself is more questionable. For comparison we also show the ratios in Fig. 1 computed with 
the relativistic WJC-1 and WJC-2 wave functions [30] (see Sec. II B below). Because the 
relativistic wave functions contain higher orders in p/M, they exhibit larger high-momentum 
tails, which leads to larger R'^/^ ratios than for nonrelativistic wave functions. Generally, 
the wave functions with the largest D-state probabilities (7.4% for WJC-1 cf. 4.8% for 
CD-Bonn) give rise to the largest distributions at large and hence the highest R^^^ ratios 
at large x. 

B. Relativistic effects 

Since quarks at large momentum fractions x are most likely to originate in nucleons car- 
rying a large momentum fraction y themselves, the effects of relativity will be increasingly 
more important as x — ?■ 1. A relativistic framework is therefore required to describe DIS from 
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FIG. 2: Dependence of the ratio R^^'^ = F^jF^ on the nuclear model and effects of relativity, 
relative to the ratio with the WBA model with the AV18 wave function and relativistic kinematics 
for 7=1. The nonrelativistic WBA model [9, 10] (solid), the MST relativistic convolution [31] 
(dashed), and the light-cone model [32] (dot-dashed) illustrate the nuclear model dependence, with 
the nuclear density extrapolation model [33] (dotted) shown for reference. 

the deuteron at very large x. The effects of relativistic kinematics are generally straight- 
forward to implement, while the dynamical effects of relativity require model dependent 
assumptions. 

Note that for relativistic kinematics the maximum value for y is determined from |/max = 
Md/M+x{M'^—p'^) / since the nucleon virtuality is unbounded from below, at any finite 

the fractional momentum y is effectively unbounded from above. In practice, though, 
taking ?/max ~ M^/M, as would be the case in the Bjorken limit, provides a relatively good 
approximation to the exact integral for values of x < 1. 

The effects of relativistic kinematics on the WBA smearing function (2) are illustrated 
in Fig. 2, where the ratio W^^^ for the smearing function with nonrelativistic kinematics is 
plotted relative to the same ratio using the relativistic expression for y, i.e., not expanding 
the recoil energy Ep. To isolate the kinematic effects on R'^^^ , the same AV18 wave function 
is used in both cases. The relativistic effects are negligible for x < 0.6, but grow to around 
5% at X ~ 0.9, before increasing rapidly as x -t- 1. 

Taking the full spin and Lorentz structure of the off-shell nucleon hadronic tensor into 
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account, a formalism for DIS from bound nucleons which incorporates both relativistic 
kinematics and dynamics was developed in Ref . [26] . This analysis identified the conditions 
under which the nuclear structure function could be expressed in the factorized convolution 
form (1), but found that in general these are not satisfied relativistically. A follow-up 
study [31] (referred to as "MST") showed, however, that one could isolate a "relativistic 
convolution" component from the total F2 , together with additive corrections associated 
with the relativistic P-state components of the deuteron wave function and nucleon off-shell 
corrections. Although the relativistic convolution component of is not unique (only the 
total structure function is), a natural choice which respects baryon number conservation in 
the deuteron (in the 7 — )■ 1 limit) is [31] 

Expanding the integrand in Eq. (4) to order /M'^ one can show explicitly that f^f^^ 
reduces to the nonrelativistic WBA smearing function in Eq. (2) in the 7 — >■ 1 limit. 

In Eq. (4) the relativistic deuteron wave function ^^(p) formally contains, in addition to 
the S- and D-state components, also the singlet and triplet P-state contributions associated 
with negative nucleon energies. To compute the relativistic smearing function we use the 
relativistic deuteron wave functions from Ref. [30] with the WJC-1 model, which has an 
admixture of pseudoscalar and pseudovector nNN couphngs, and the WJC-2 model, which 
has only the pseudovector coupling. The WJC-1 model generally has larger high-momentum 
components than the WJC-2 model, as evidenced in its larger D-state probability, and the 
pseudoscalar nNN coupling leads to significantly larger P-state wave functions than for 
the pure pseudovector model. The harder WJC-1 wave function gives rise to a ratio R^^^ 
which a,t X > 0.6 is larger than that for WJC-2, and as observed in Fig. 1 the larger high- 
momentum tails in both the relativistic wave functions yield larger contributions to Pg (and 
j^d/N^ at large x than in the nonrelativistic models. 

To assess the model dependence associated with the choice of smearing function, in Fig. 2 
we show the ratio P'^/^ computed from the MST smearing function (4) relative to the WBA 
model with relativistic kinematics (2). The differences are < 1% for x < 0.8, rising to 
~ 5% as X — > 1. For comparison we also show the ratio computed using an early light-cone 
prescription for nuclear DIS [32], which has been previously used in some Pg data analyses 
[34]. In this approach the bound nucleons are taken to be on their mass shells, with the 
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struck nucleon energy po — Ep, but with energy not conserved at vertices. The hght-cone 
smearing function (for 7 = 1) is given by [32] 




which preserves baryon number and imphes that the entire momentum of the deuteron is 
carried by the nucleons. Note that for consistency the hght-cone wave function ^'^^^^(p) 
in Eq. (5) should be computed using hght-front dynamics (or in the infinite momentum 
frame), although in practice ordinary rest- frame wave functions are used. Because the light- 
cone prescription does not explicitly incorporate nuclear mesons, which are responsible for 
nuclear binding, it effectively corrects the deuteron structure function for the effects of 
Fermi motion only. This is reflected in the ratio R'^^^ in Fig. 2 being larger at intermediate 
X (0.2 < X < 0.8) than in the models which do account for nuclear binding. Nevertheless, 
the differences between R^/^ computed with the hght-cone smearing function f^J^ and the 
others in Fig. 2 are less than 2-3% for all x. In practice, because of the inconsistency of 
mixing rest-frame wave functions with the light-cone smearing function (5), as well as the 
need for finite-Q^ corrections in the smearing function (which are currently not available for 
the light-cone or MST models), we shall restrict our analysis to the WBA smearing function. 

Finally, we compare the microscopic calculations with a qualitative model in which the 
ratio R'^^^ is extrapolated from data on the ratio of heavy nuclei to deuterium structure 
functions assuming scaling with nuclear density [35] . Using the empirical nuclear density for 
^^Fe and an ansatz for the charge density of deuterium [33, 36], the ratio is obtained from 
jld/N ~ X _|_ (/^Fe/d _ i^^4^ [33]^ which implies similar x dependence of the nuclear corrections 
in ^^Fe and deuterium. Consequently, i?''/^ in the density model has a signiflcantly larger 
depletion at x ~ 0.6 and a rise above unity which does not set in until x ~ 0.8. This is 
significantly higher than typically found in smearing models, as reflected in the rapidly falling 
ratio R^^^ in Fig. 2 for the density model compared with the other calculations [37]. The 
nuclear density scahng is also in disagreement with recent data on fight nuclei [38] which do 
not support simple A-dependent or density-dependent fits to the nuclear structure function 
ratios. Because of this, as well as the difficulties in defining physically meaningful nuclear 
densities for deuterium [36], we do not consider the density model sufficiently quantitative, 
especially at large x, to be refiably employed in global PDF analyses. 
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FIG. 3: Dependence of the ratio R^^'^ = F2/F2 on the off-sheh corrections to the nucleon structure 
function, relative to the on-sheh WBA model. The range of off-shell behaviors includes the MST 
off-shell model [31] (dashed), the Kulagin-Petti model [9] modified for the case of deuterium (mKP, 
solid and shaded band), and the Kulagin-Petti phenomenological fit to large-A structure function 
data [9] (dot-dashed). 

C. Off-shell corrections 



The discussion of nuclear effects above has been under the assumption that the struc- 
ture function of the bound nucleon in the convolution formula (1) does not change inside 
the nucleus, and can be approximated by the on-shell F2 ■ With the inclusion of off-shell 
and relativistic corrections, it was shown in Ref. [31] that the convolution formula receives 
nonfactorizable corrections associated with higher powers of the bound nucleon momentum 
p/M and contributions proportional to — M^, = _|_ The correction 

^(off)^a! ^g^g estimated within a simple quark-spectator model [31], with the parameters of 
the quark-nucleon correlation functions fitted to proton and deuteron F2 data. The overall 
effect is a reduction of about 2% of F2 compared to the on-shell (convolution) approxima- 
tion, as illustrated in Fig. 3. A simple parametrization of 5'^°^''F2 relative to the total F2 in 
the model of Ref. [31] was given in Ref. [1]. 

Within the WBA model, inclusion of explicit off-shell dependence in the bound nucleon 
structure function leads to a generalization of Eq. (1) in which is expressed as a two- 



10 



dimensional convolution in terms of y and the nucleon virtuality [8-10]. For small values 
of — M^l/M^ <^ 1, one can expand the off-shell structure function in a Taylor series 
about p2 = [9], 

(x, Q'y) = Fi'ix, Q') {l + 5h{x, Q') , (6) 

where the coefficient of the — term is given by the derivative of the structure 
function, 



(7) 



Assuming that 5f2 is independent of the nucleus A, a phenomenological fit to all available 
nuclear F^/F^ data, with A ranging from ^He to ^°'''Pb, found for the nonrelativistic model 
[9] 

= Cn{x - 0.05) (x - Xo)(l + xo - x), (8) 

with the best-fit parameters — 8.1(3) and Xq — 0.448(5). The zero crossings of at 
X — 0.05 and x — xq give rise to an enhanced R^^^ ratio for x < 0.45, and a sharp drop 
for X > 0.6 compared with the on-shell result, as Fig. 3 illustrates. This reflects the general 
features of the data on the nuclear EMC ratios R^^'^ = F2 / F^, which the fltted off-shell 
correction mimics [9]. Since one assumes the same shape for the off-shell correction in 
the deutcron as in heavy nuclei, the x dependence of i?'^/^ in this model is similar to that 
of R^/'''. This feature is reminiscent of the nuclear density model [33] (see Sec. JIB) which 
scales the magnitude of R^^'^ to the deuteron, but assumes the same x dependence. While 
this scaling may be reasonable for large-A nuclei, we do not believe that an extrapolation 
all the way to the deuteron is likely to be reliable, especially at large x. In addition, the 
phenomenological fit (8) does not conserve the total baryon number in the nucleus. Some 
cancellation is found between the off-shell and nuclear shadowing corrections [9]. However, 
since shadowing contains important higher twist in addition to leading twist contributions, 
the cancellation cannot be exact at all Q^. 

To avoid these problems we choose instead to use a microscopic model proposed by 
Kulagin & Petti [9] (see also Ref. [8]), but modified to apply to the specific case of the 
deuteron in the valence quark region at large x. Here the valence quark distribution can be 
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expressed through a spectral representation as 

ds / dk^ Dq/r^{s,k'^,x,p'^), (9) 

J — oo 

where is the virtuahty of the quark, with maximum value k^^^ — x{p^ — — x)), and 
s — {p — ky is the mass of the spectator quark system. The spectral function D^/iv can be 
approximated by a factorized form 

D,,N ~ 8{s-s^)^k\K{p^)), (10) 

where the function $ describes the distribution of valence quarks with virtuality k'^ in an 
off-shell nucleon with invariant mass p^, and for simplicity the spectator quark spectrum is 
assumed to be given by a single mass sq- The nucleon off-shell dependence is parametrized 
through the p^ dependence of the large- fc^ cut-off mass parameter A(p^) in Prom a fit 
to the free proton data one finds the value sq = 2.1 GeV^ [9]. The scale A can be related 
to the nucleon confinement radius, A ~ 1/Rn, so that the dependence of A on refiects a 
change in the size of the bound nucleon in a nuclear medium. The off-shell function Sf2 can 
then be written as [9] 

dlOgQy 

5h ^ c+ h{x), (11) 

where 
and 

A=aiogA791ogp2|^,^^, . (13) 

In terms of the confinement radius Rn the parameter A can also be written as A = 
—2{5Rn/Rn){5p'^/M'^), where 5p^ = J dy ip^ - M'^)fN/d{y) is the average nucleon virtu- 
ality in the deuteron. 

In the analysis of Ref . [9] , A and c were chosen to reproduce the shape of the KP fit (8) , 
finding A = 1.03 — a value that corresponds to an increase in the confinement radius of 
SRn/Rn ~ 9%, as may be expected for a heavy nucleus such as ^^Pe. Here we modify the 
model to the specific case of the deuteron, for which 5Rn/Rn ~ 1.5 - 1.8% [39]. Por 5p'^ 
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we take a range of smearing functions computed with different wave functions (AV18, CD- 
Bonn, WJC-1, WJC-2), as well as relativistic versus nonrelativistic kinematics, resulting in 
the range (5p^/M^ Ri —3.6% to —6.5%. This gives A in the range 0.46 - 1.00, with a central 
value of 0.65. Finally, the normalization coefficient c is computed by requiring that the 
off-shell corrections do not modify the valence quark number. 



The resulting ratio R^^^ with the off-shell corrections in this "modified Kulagin- Petti" 
(mKP) model is indicated in Fig. 3 by the shaded band, which reflects the uncertainty 
in the parameter A. For x < 0.5, the ratio with the mKP off-shell correction is similar to 
that with the MST off-shell model, ranging between 1% and 3%, but grows with increasing 
X. At X = 0.9 the off-shell ratio is some 5% - 15% smaller than the on-shell ratio. These 
uncertainties will propagate through the global analysis and affect the precision to which 
PDFs at large x can be determined, as discussed in Sec. IV. Note that in principle the fits 
should be performed using the same deuteron wave function to compute both the smearing 
function and the off-shell correction (namely, for a specific value of Sp^/M"^ rather than the 
above range). However, given the uncertainty in SRn/Rn, as well as the assumptions in- 
herent in the off-shell model itself, we take the conservative approach of including the total 
off-shell uncertainty in the fits for each smearing function. 

D. Nuclear shadowing 

At intermediate and large values of x the nuclear impulse approximation, embodied in 
the convolution approximation of Eq. (1), provides a reasonable basis for computing nuclear 
structure functions. At small values of x, however, this approximation becomes less reliable 
and the probability of scattering from more than one nucleon in the nucleus becomes relevant. 
For the case of the deuteron one can compute the shadowing correction arising from the 
double scattering of the virtual photon from both nucleons using the Glauber expansion in 
the eikonal approximation (for a review see Ref. [40]). 




(14) 



which leads to the constraint 




(15) 
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In the target rest frame one can view nuclear DIS at small x in terms of virtual photon 
fluctuations into qq pairs of mass m^^, which then interact with the target. The propagation 
length of the virtual qq state is M ~ 2z//(Q^ + m^^) with v the virtual photon energy. If 
this exceeds the inter- nucleon separation (~ 2 fm), then the hadronic state can interact 
with different nucleons as it passes through the nucleus. For large v ^ iriqq the propagation 
length becomes ~ 1/Mx > 2 fm, so that shadowing should start to appear at x < 0.1. 

At low the spectrum of qq states can be well approximated by vector mesons a;, 0), 
while at high the interaction is most efficiently described through diffractive scattering 
of the qq pair from partons, parametrized by Pomeron exchange. Such a two-phase model 
was adopted in Refs. [41-44] to compute the shadowing corrections in deuterium and heavy 
nuclei. The vector meson dominance contribution vanishes at sufficiently high Q^, but in 
the ~ few GeV^ range it is in fact responsible for the majority of the variation [42] . 

An additional double scattering contribution arises from the interaction of the virtual 
photon with a mesons (tt, p, . . .) exchanged between the nucleons in the deuteron [42, 45]. 
This correction is mostly positive at small x ( "antishadowing" ) , and partly cancels some of 
the shadowing arising from Pomeron exchange and vector meson dominance [42]. 

In our analysis we parametrize the total deuterium shadowing correction from Ref. [42] 
(similar values were obtained in Refs. [41, 44, 46]) as an additive contribution to the deuteron 
structure function, F2 — >■ F2 + S'^^^^^F2. The resulting correction ranges from about 1.5% 
at X — 10'^ to about 3% for x < 10~^. The effect of such corrections on PDFs in the region 
of interest of the current analysis is negligible. 

III. DATA SELECTION AND FITTING PROCEDURES 

The data sets used in this analysis are the same ones chosen in the recent CTEQ6X 
analysis [1]. Since this represents an extension of the previous work, utilizing the same data 
sets facilitates an easy comparison with the results presented there. The data sets include 
charged lepton DIS on proton and deuteron targets, lepton pair production with a proton 
beam on proton and deuteron targets, Vl^-lepton asymmetry and W asymmetry data sets 
from pp interactions, and 7 + jet and inclusive jet data sets from the Tevatron. As in Ref. [1], 
we adopt for our main fits the cuts > 1.69 GeV^ and W"^ > 3 GeV^. Before proceeding 
with the discussion of the fits, it is useful to briefly review the flavor differentiation at large 
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X provided by the various types of data. 

In the large-x region the relevant quark PDFs are those for the u and d quarks. Consider 
first charged lepton DIS. For large values of x the structure functions are given, to lowest 
order in a^, by 

Fl{x, Q') ^ I (4«(^, Q') + d{x, Q')) (16) 

and 

F^{x, Q') ^ g {u(x, Q') + d(x, Q')) , (17) 

where the superscript N denotes the structure function of an isoscalar nucleon target ob- 
tained from deuterium data by including nuclear corrections. If one had sufficient data of 
both types, then the separate u and d PDFs could be disentangled. The problem, however, 
is that variations in the nuclear corrections cause variations in the extracted which, in 
turn, causes variations in the extracted d PDF. The u PDF is relatively better determined 
since, at large values of x, the d/u ratio is small and the u PDF enters with a weight which 
is four times larger than that for the d PDF in F2 . Thus, the nuclear corrections primarily 
affect the d PDF. 

Next, consider lepton pair production. Since the PDFs enter in a manner different than 
for charged lepton DIS, one may hope to use these data to constrain the d PDF. Using 
lowest order kinematics one has 

Mil — ^1^2 s and xp = xi — X2, (18) 

where Mu is the lepton pair invariant mass, xp — '^Pz/ is the scaled longitudinal momen- 
tum of the pair, and xi and X2 denote the beam and target momentum fractions, respec- 
tively. The region of interest for typical fixed target experiments is large xi and small X2, 
where the proton-proton cross section is (neglecting the dependence) 

a{pp) (X u{x2) (^M^i) + d{xr) ■ (19) 

At small values of 0:2 <^ 1 one has u{x2) ~ d{x2) so that this cross section is approximately 
given by the same linear combination of PDFs as enters , namely, iu{xi) -\- d{xi). This 
situation is not helped significantly even if one considers lepton pair production on a deuteron 
target since for small X2 the nuclear effects discussed previously are small and, using isospin 
invariance, the proton-neutron cross section is 

a{pn) oc d{x2) (mxi) + d{x,) ^p^] . (20) 
V d{x2)/ 
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Thus, one is still sensitive to the same hnear combination of PDFs at large values of xi, as 
probed by available data. 

Next let us examine data for jet production. The cross section for this process involves 
combinations of quark-quark, quark-gluon, and gluon-gluon scattering subprocesses. The 
large- a; gluon PDF is constrained mostly by just such data, assuming that the quark PDFs 
are fixed. Therefore, if the d PDF is varied it is possible for the gluon PDF to compensate 
the change in the jet cross section by a similar but opposite variation in its magnitude. As 
a consequence, we can also expect the gluon PDF to be sensitive to nuclear corrections, and 
anti-correlated to the rf-quark PDF. 

This leaves the data for the W asymm,etry, A]^, and W-lepton asymmetry, Ai. These 
asymmetries arc sensitive to the d/u ratio; however, the data coverage extends at most to 
W rapidities of yw ~ 2.6, which corresponds to x ~ 0.6. The data for Ai extend to lepton 
rapidities of yg ~ 3. This formally corresponds to x ~ 0.8, were it not for the smearing 
induced by the lepton decay vertex, which reduces this to x 0.6 (see Sec. IV D). 

Thus, one can see that to constrain the d PDF in the large-x region using existing data 
one must use DIS data on deuterium, and there one is faced with the necessity of addressing 
the issue of nuclear corrections. In Sec. V below, methods of reducing this dependence on 
the nuclear corrections will be discussed. 

IV. FIT RESULTS 

The global fits discussed here were performed using the same fitting package and tech- 
niques as described in Ref. [1]. We refer to these as the "CJ" (CTEQ- Jefferson Lab) PDF 
fits. The nuclear effects in the deuteron were computed using the smearing function (2) with 
relativistic kinematics. To quantify the systematic uncertainty on the PDFs due to the nu- 
clear corrections, we used the AV18, CD-Bonn, WJC-1 and WJC-2 deuteron wave functions, 
with on-shell nucleons or off-shell corrections from the range allowed in the mKP model, as 
discussed in Sec. II. We also considered a generalization of the d-quark parametrization at 
the initial scale Ql = 1.69 GeV^ that allows the d/u ratio to have an arbitrary value in the 
a; — )■ 1 limit, as opposed to either or infinity as with standard parametrizations used in 
global fits. 



16 




FIG. 4: Deuterium to isoscalar nucleon ratio R^^^ = F2 /F,^ using the convolution approximation 
in Eq. (1) for a "toy model" input ~ (1 — x)^, without off-shell corrections (dashed) and with 
the mKP off-shell corrections (11) included (solid and shaded band). For comparison the ratio 
with the nuclear density extrapolation model (dotted) is also shown. 

A. Impact of nuclear corrections 

To understand the impact of the nuclear smearing and off-shell corrections, it is useful 
to examine the ratio of F2 on a deuterium target to that on an idealized isoscalar target 

= p+n. To simplify the comparison we take a "toy model" for the input nucleon structure 
function, ~ (1 — x)^, and compute the deuteron structure function from the convolution 
approximation in Eq. (1) with the smearing function calculated using the AV18 deuteron 
wave function. The resulting ratio W^^^ is illustrated in Fig. 4 (dashed curve), and compared 
with the ratio computed including nucleon off-shell corrections from the mKP model (solid 
and shaded band), and, for reference, also with the nuclear density extrapolation model 
(dotted). As anticipated from Fig. 3, the effect of the off-shell corrections is a further 
suppression of R'^^'^ over most of the range of x, with the ratio now rising above unity 
for X > 0.7. The dip in the W^^^ ratio in the density model is more pronounced than in 
the smearing models, even compared with the largest off-shell corrections, and has a shape 
that essentially follows that of ratios of structure functions of heavy nuclei, such as ^^Fe, to 
deuterium. 
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Qualitatively, implementing the nuclear corrections is equivalent to dividing the deu- 
terium data by R^/^ , thereby yielding results for an isoscalar target. In actuahty, the cor- 
rections involve convolutions as well as realistic inputs instead of the simple ~ (1 — xY 
form we use for illustration. However, the impact of the nuclear corrections on fits of the 
d-quark distribution can be understood from the following simple argument, which does not 
depend on these details. When F2/F2 is less than one, the removal of the nuclear effects 
increases the data and therefore yields a larger d PDF than one would have had if the nuclear 
effects were ignored. Conversely, if the ratio is greater than one, the fitted d PDF will be 
reduced relative to that extracted from uncorrected data. From the ratio in Fig. 4 one can 
see that there will be an increase in the d PDF for x < 0.6 — 0.7 (depending on the size of 
the off-shell corrections), while the Fermi smearing rise in the ratio leads to a decrease in 
the fitted d PDF for x > 0.6 - 0.7. 

B. d/u ratio 

Since the d-quark PDF is most affected by variations in the nuclear corrections, with the 
u quark relatively unchanged, the results of the fits are best summarized by examining the 
ratio of (i to -u PDFs. This ratio is also of theoretical interest as its limiting value as x — )■ 1 is 
a sensitive indicator of nonperturbative quark-gluon dynamics in the nucleon [47, 48]. The 
effect of nuclear corrections on the gluon PDFs will be discussed in Sec. IV E. 

In Fig. 5 the d/u ratio is shown for various models of nuclear corrections. The left 
and middle panels show the variation induced by the choice of nucleon off-shell corrections 
with a fixed (AVIS) deuteron wave function. As the magnitude of the off-shell corrections 
increases, the d/u ratio correspondingly rises. The left panel is obtained using a conventional 
parametrization for the d PDF which behaves as a power of (1 1 [1, 2]. The 

middle panel shows the results of employing a modified form of the d PDF parametrization 
at the input scale Qq, 



with a and h fitted parameters (see also Refs. [35, 49]). With the conventional parametriza- 
tions of the d and u PDFs the d/u ratio goes either to zero or infinity as a; — >■ 1; with the 
modified form the ratio d/ii — > a as x — > 1. The fits, depending on the nuclear correction 
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FIG. 5: The d/u ratio at = 10 GeV^ obtained with different choices of d-quark parametrization 
and nuclear corrections: {left panel) standard d-quark parametrization [1] and AV18 deuteron wave 
function for several off-shell corrections models; (middle panel) modified d-quark parametrization 
(21) for the same nuclear corrections; (right panel) dependence on the deuteron wave function for 
a fixed off-shell correction (mKP). The shaded bands and bottom panels show the Ax = 1 PDF 
errors and the relative PDF errors on the d/u ratio, respectively. The thinner lines above x ~ 0.8 
denote extrapolations into unmeasured regions. 

used, show a slight to marked preference for a nonzero value of a, with its value increasing 
as the off-shell corrections become stronger. The error bands are larger for the case of the 
modified parametrization since the ratio is not required to go to zero at a; = 1, so one has a 
wider range of possibilities for the ratio in the region beyond x = 0.8. Note, however, that 
the relative errors are essentially the same for both parametrizations. 

Variations in the choice of the deuteron wave function also affect the extracted d PDF, as 
shown in the right panel of Fig. 5, where the modified d parametrization (21) is utilized. The 
resulting spread is comparable in magnitude to that observed for variations in the nucleon 
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FIG. 6: Full range of variations of the d/u ratio due to deuteron wave function and off-shell 
corrections (cross-hashed band) and PDF fit uncertainties (diagonal hashed). The arrow indicates 
the extrapolation region at large x. 

off-shell correction models. 

Note that the d/u results above x ~ 0.8, marked by thinner lines in Fig. 5 for the central 
values, represent extrapolations of the fitted PDFs and are not constrained directly by data. 
Correspondingly, the fit errors increase significantly for x > 0.8, especially for the PDFs 
using the less restrictive d parametrization (21), as noted above. In all the fits the error 
bands are calculated using the Hessian technique and correspond to Ax = 1, which was 
chosen for clarity of the presentation. A larger choice of Ax would have made it difficult to 
display the effects of variations in the nuclear models. 

Figure 6 shows the total variation in the d/u ratio (cross-hashed band) for all the choices 
of deuteron wave function and off-shell corrections considered in this analysis, using the 
modified d-quaik parametrization (21). The highest rf-quark PDF at large x is obtained 
with the combination of CD-Bonn wave function and the upper limit of the mKP off-shell 
correction, while the lowest d quark is obtained with the WJC-1 wave function and no 
off-shell corrections. The outer band in Fig. 6 (diagonal hashed) shows in addition the 
uncertainty from the PDF fit. As a reference fit for later use we choose the PDFs extracted 
using the AV18 and mKP (central) off-shell corrections, which produces a d-qnaik PDF 
approximately midway between the upper and lower limits. Above x ~ 0.8 the PDFs are 
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not directly constrained by data and represent extrapolations, as indicated by the arrow in 
Fig. 6. 

It is clear that the systematic uncertainty on the extracted d PDF due to the model de- 
pendence of the deuteron corrections is considerable. One of the main interests of measuring 
the d/u ratio at large x is that its limit as a; — )■ 1 is sensitive to the nonpcrturbativc struc- 
ture of the nucleon [47, 48]. However, given the large nuclear model uncertainty, essentially 
any value of d/u between and ~ 0.5 is currently allowed, which encompasses most values 
predicted by the nonperturbative models [47]. 

The nuclear model uncertainty on the d/u ratio is several times larger than the Ax — 1 
PDF uncertainty induced by the experimental errors. Other global fits, such as MSTW08 
[3] or CTIO [2], utilize a different criterion to quote the PDF experimental uncertainty, 
which can be approximately reproduced using Ax = 7. In this case the PDF nuclear 
uncertainty would be slightly smaller than the experimental uncertainty. Overall, the nuclear 
and experimental uncertainties are of the same order of magnitude. Progress in constraining 
the d-quark PDFs at large x will therefore require either a better understanding of nuclear 
corrections to reduce the nuclear corrections systematics, or the use of a much larger free- 
nucleon data set sensitive to large-a; d quarks than presently available. 

C. Fits of neutron structure functions 

The PDFs obtained from our global fit can be used to compute the structure functions 
of a free neutron, providing an effective method to extract these from inclusive deuteron 
DIS data, which additionally benefits from information coming from a much wider range of 
processes. In Fig. 7 we show the F^/Fl* ratio calculated including the full range of nuclear 
variations considered in our fits. As in Fig. 6, the cross-hashed band illustrates the variation 
due to the choices made for the deuteron wave function and the off-shell corrections, while 
the hashed band shows in addition the uncertainty coming from the PDF errors. The arrow 
serves as a reminder that the bands at x > 0.8 are an extrapolation unconstrained by data. 
Note that as a consequence of the positivity of the u and d quark distributions, / must 
be greater than 1/4 for all values of x. (The PDF error band can exceed that limit because 
it was obtained with the Hessian technique, which assumes Gaussian errors also close to the 
limits of the PDF functional space.) 
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FIG. 7: As in Fig. 6, but for the FJ^jFl ratio. For comparison, the central value of a fit with no 
nuclear corrections applied is also shown (dashed). 

For comparison we also show in Fig. 7 the F^jFl^ ratio extracted assuming no nuclear 
corrections in deuterium. Above x ~ 0.75 the neutron structure function extracted under 
this assumption is significantly larger than the range allowed by our fits. (In fact, assuming 
F^ = F2—F2 implies a divergent experimental F2 jF^ ratio at a; = 1, where F^ vanishes but 
F^ remains finite.) The nuclear uncertainty in the ratio in Fig. 7 is thus somewhat 

smaller at large x than would be the case if we had considered the entire range of possibilities 
for the nuclear effects, including the "no nuclear corrections" ansatz. Nonetheless, given the 
choice of the range of nuclear corrections discussed in Sec. II, the F^/Ff ratio at x ~ 1 
appears consistent with any value between 1/4 and ~ 0.7, covering all nonperturbative 
model predictions [47, 48]. 

A recent effort to extract the neutron structure function from inclusive deuterium data 
[50, 51] follows a complementary approach to that adopted here. Firstly, phenomeno logical 
fits to proton and deuteron structure functions are used to interpolate available data to a 
fixed value of Q^. The data are then analyzed within the convolution framework, using a 
range of nuclear corrections selected from the literature [51]. The resulting size of uncer- 
tainties on the F2/F2 ratio in Ref. [51] is smaller than but comparable to that in Fig. 7. 
However, the central value was found to decrease monotonically with x and shows no sign 
of a plateau. This is in contrast to our findings, which suggest a possible plateau or even 
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increase in F2/F2 for a; > 0.7 — 0.8. An important difference between our analysis and that 
of Refs. [50, 51] is that our F2/F2 ratio is determined from global fits of leading twist PDFs 
(which are constrained to be positive definite), utilizing a wide array of data, not restricted 
to hydrogen and deuterium DIS only. The data analyzed in [50, 51], on the other hand, are 
at the structure function level, including possible higher twist and other subleading 
corrections, so that F^/F^ is not constrained to lie above 1/4. 

It is worth reiterating that there is a wealth of data in the region up to x = 0.98 over a 
large range of photon virtualities, 7 < < 31 GeV^, which most previous global fits have 
explicitly not utilized in order to avoid nonperturbative effects such as nucleon resonances, 
target mass and higher twist corrections. Our fits include a sizable fraction of these data, 
delineated by the cuts Q"^ > 1.69 GeV^ and W'^ > 3 GeV^, but stiU avoid most of the 
resonance region data, and are limited to x ~ 0.8 as indicated by the arrow in Fig. 7. These 
resonance region data are, however, included in the analysis of Ref . [50] , and may account 
for some differences in the final results as compared to this work. Studies of the resonance 
region data using the standard nuclear corrections demonstrate agreement between the DIS 
data sets and averages over resonance region data [52-54], and future efforts will explore 
ways to incorporate the latter within a global PDF framework. 

D. No-deuteron fits and the W asymmetry 

One of our motivations for including the deuterium data in the global fit was to reduce 
the uncertainty on the d PDF in the large-x region. The results presented thus far indicate 
that variations in the choices for the nuclear models can be largely compensated by changes 
in the d PDF. The question thus naturally arises as to whether or not the addition of the 
deuterium data has helped reduce the error on the d PDF at all. This can be examined 
by comparing in Fig. 8 the results of two fits with and without the deuterium data; for the 
former, results are shown for our reference fit using nucleon smearing in the deuteron with 
the AV18 wave function and the mKP off-shell corrections. 

Two things stand out in this figure. First, for a specific choice of the nuclear corrections, 
the d uncertainty is significantly reduced, as would be expected. Thus, if a method could 
be devised which would narrow the options for the choice of the nuclear models, then the 
deuterium data would provide the information that is needed to constrain the d-quark PDF. 
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FIG. 8: d/u ratios extracted from our reference fit, using the nucleon smearing function in the 
deuteron with the AV18 wave function and the mKP (central) off-sheh corrections. The sohd 
curve utihzes all the data (with uncertainties given by the cross-hashed band); the dashed curve 
(with uncertainties given by the diagonal-hashed band) results from removing all the deuterium 
DIS data sets, avoiding the need for deuteron corrections. The PDF error bands correspond to 
Ax = 1. The arrows indicate the extrapolation region of each fit [x > 0.6 for the no-deuteron fit, 
and X > 0.8 for the full fit). The relative error of the two fits is shown in the bottom panel. 

On the contrary, in the absence of constraints on the nuclear models, the systematic nuclear 
uncertainty is comparable to the PDF uncertainty in the no-deuteron fit. Secondly, the 
removal of the deuterium data results in a significant increase in the d/u ratio at low values 
of X, where the nuclear smearing and off-shell corrections to deuteron data are negligible. 
This was rather unexpected and suggests that the deuterium experiments are pulling against 
some combination of other data in this region. One might expect that in this region of 
low X nuclear shadowing corrections might be important; however, examination of various 
shadowing models suggests a correction of only a percent or so, insufficient to explain the 
shift in Fig. 8. 
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FIG. 9: (Left) The CDF W asymmetry data compared to the calculation using fits with deuteron 
data and nuclear corrections (shaded band) and the "no deuteron DIS" fit (dashed curve). (Right) 
The D0 W decay electron asymmetry data integrated over pT > 25 GeV compared to the same 



Inspection of the proton DIS data sets shows that the improvement between the 
reference and no-deuteron fits is slight and is not confined to any specific region of x; this is 
true also for the lepton pair and jet data. However, the CDF W asymmetry data, shown in 
Fig. 9 (left panel), suggest a marked decrease in of about 20 units at W rapidity yw < 1-9. 
In contrast, for yiy > 1.9 the nuclear uncertainties become non- negligible, making it more 
difficult to quantify the improvement of the no-deuteron fit, which is strongly dependent 
on the reference nuclear model. These observations can be understood by noting that the 
W asymmetry is approximately given by 



where Xi,2 = {Mw / y/s) ^wi^Vw), and M^* is the four-momentum squared of the virtual 
W boson. Equation (22) is a simple leading-order result in which the effects of the sea 
quarks are neglected and the antiquark distributions in the antiproton have been set equal 
to the quark distributions in the proton. It is clear that, at positive rapidity, increasing d/u 
at small x or decreasing d/u aX large x results in an upward shift of Ay/. 

The interplay between xi and X2 becomes clear from Fig. 10, which shows Xi^2 versus 
y\Y at CDF kinematics (left panel), and the fitted the d/u ratios on a logarithmic x scale 
(right panel) for the minimum and maximum nuclear corrections, and the no-deuteron fit. 
Data at yw > 2 are sensitive to PDFs at Xi > 0.3, where the nuclear correction uncertainties 
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FIG. 10: (Left) Values of xi.2 versus W rapidity yw at CDF W kinematics. (Right) d/u ratios on 
a logarithmic x scale for the "no deuteron DIS" fit (long dashed), and for maximum (solid) and 
minimum (short-dashed) nuclear corrections. 

become increasingly large, and at X2 < 0.005, where the difference between the full and no- 
deuteron fits becomes small and the d/u ratio is well constrained. Laige-yw data are thus 
mostly sensitive to large-x partons, and are affected by nuclear uncertainties. Data at yw < 2 
are sensitive to intermediate values of Xi^2 £ (0.01,0.1), precisely where the deuteron and 
no-deuteron fits differ maximally. It is clear that the improvement in at small yw is 
entirely due to this region, where the deuteron DIS data and CDF W asymmetry data pull 
against each other. 

The D0 collaboration data on the asymmetry of the electrons originating from the 
W decay, integrated over electron transverse momenta pt > 25 GeV, is shown in the right 
panel of Fig. 9: the small decrease in observed at small lepton rapidity ye is compensated 
by a small or large increase at large rapidity, depending on the reference nuclear correction. 
A similar behavior is observed in the D0 muon asymmetry data. Thus, in our fits, the 
lepton asymmetry data seem to pull only very slightly against the deuteron target DIS data. 
In this respect, one should note that due to the nature of the vector-axial vector decay of 
the W, the lepton asymmetry is less sensitive to the d/u ratio than is the case for the W 
asymmetry itself. 

The d/u ratio in the no-deuterium fit is significantly steepened in the low-x region, as 
shown in Fig. 8. This results in an increase in the W asymmetry with a corresponding 
decrease in x^- ^in effort to understand the source of this behavior, the fitted PDFs 
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were used to compare the theoretical predictions with the deuterium data which had been 
removed from the fit. The SLAC and BCDMS data start ai, x — 0.07, the Jefferson Lab 
data at X — 0.40, and the NMC deuterium/proton ratio data at x = 0.007. The NMC ratio 
data have the most overlap with the relevant x region for the low rapidity W asymmetry 
and, therefore, they showed the biggest increase in when the reference and no-deuterium 
results were compared. Recently, Alekhin et al. [55] have questioned the value of the ratio 
of the longitudinal and transverse cross sections, used to extract the structure functions 
from the NMC cross section data. However, as noted in Ref . [56] , the structure function ratio 
is relatively insensitive to R. One might also worry about the choice of parametrization for 
the PDFs. However, it is clear that the W asymmetry data prefer a somewhat steeper d/u 
ratio at small x than do the deuterium data and no change in the form of the parametrization 
could alter the resulting tension since it occurs in a common range of x. 

In summary, these plots and the inspection show that the CDF W asymmetry data 
and the D0 lepton asymmetry data seem collectively to prefer a somewhat larger d/u ratio 
than is favored by the deuterium DIS data in the region below x < 0.2. Moreover, as noted 
previously, at larger x the lepton asymmetry data prefer a somewhat larger ratio than the 
CDF W asymmetry. This unfortunately prevents them being used to constrain the nuclear 
corrections, to which they are mildly sensitive. The origin and possible resolutions of the 
tension between the W and lepton asymmetries, as well as that between these asymmetries 
and the deuteron data, is being actively discussed [2, 3, 57-59]. 

E. Parton luminosities at hadron colliders 

To discuss the effects of the nuclear model dependence on different PDFs, the extremes 
of the d PDF are shown in Fig. 11 divided by the reference fit PDF, along with the corre- 
sponding li-quark and gluon PDFs. Even though the d PDF at large values of x varies by 
as much as 60%, the u PDF shows httle variation. This is because the DIS and lepton 
pair data at large x are for the most part sensitive to the combination A.u -\- d, and the d/u 
ratio is on the order of 0.1 in this region. Thus, the role played by the u PDF here is about 
40 times more important that of the d PDF. Therefore, a small and anticorrelated shift in 
the u PDF can compensate for a large change in the d PDF. 

The situation is somewhat different for the case of high-p^ jet production. This process 



27 



1.6 
1.2 
0.8 
0.4 

0.5 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 

■ 


M 1 1 1 1 1 M 1 1 1 M 1 1 M 1 1 1 1 /I 

rd/d,,, /- 


M 1 1 1 M 1 1 1 1 1 M 1 M 1 1 1 M/l 1 

- S/ Sref ' 
/ 

/ 

✓ 


I r 1 ; I X nuclear 
min nuclear 

J 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1. 


s 

— \ 

\ 

\ 

\ 

- 1 1 1 \ - 


J M 1 M M M 1 1 1 M M M M. 


: (5u/u : 

~ ' ' ' 1 ' ' 1 1 1 1 1 1 1 1 1 1 1 1 1 1 rTf 


INI MM MM 

: (5d/d ^' : 

— / — 
: / : 

"' ' ' ' M 1 n M 1 M M liVfl'l^r 


: ^^^^ /: 
"i 1 1 1 M 1 iT+MM M 1 1 M 1 1 1 r 



0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8 

X 



FIG. 11: The extremes of the variations of the u (left panel), d (middle panel), and gluon (right 
panel) PDFs, relative to reference PDFs extracted using the smearing function with the AV18 
deuteron wave function and mKP (central) off-shell corrections (11). 

receives contributions from qq, qg, and gg scattering subprocesses with the qq subprocesses 
dominating at large values of x for the target. Since the u PDF is already well constrained, 
a variation in the d PDF is compensated by a significant and anti-correlated shift in the 
gluon PDF. The sea quarks are largely unconstrained at large x, and follow the behavior of 
the gluon, that generates them radiatively; the only exception is the d quark, which seems 
to follow the d quark at0.2<x<0.6 and the gluon only at x > 0.6. 

The large nuclear uncertainty in the quark PDFs, and even more in the gluon PDF, 
which shows a significant variation at values of x as low as 0.4, have potentially profound 
implications for future collider experiments. As an illustration, we consider the parton 
luminosities 
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where s (s) is the hadronic (partonic) center of mass energy squared, and fi{x,Q^) is the 
PDF for a parton of flavor i (quark or gluon) at fractional momentum x and scale Q^. In 
Fig. 12, the parton luminosities are plotted at -y/i = 7 TeV relevant to the current LHC 
runs. The gg luminosity controls the main channel for Higgs production, the gd luminosity 
is relevant for jet production, and the du luminosity controls the "standard candle" cross 
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FIG. 12: Parton luminosities at ^/s = 7 TeV at the LHC. Shown are the extremes of the variations 
due to deuterium nucleon corrections of the gg (left), gd (middle) and du (right) luminosities 
relative to reference PDFs extracted using our default nucleon smearing function in the deuteron 
with the mKP (central) off-shell corrections (11). 
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FIG. 13: Differential parton luminosities for gg (left), gd (middle) and du (right) at fixed rapidity 
y = 1, 2 and 3, as a function of r = ^ s/s, illustrating the variations due to the choice of deuterium 
nucleon corrections. 

section for W~ production. Figure 12, shows that the theoretical uncertainty induced by 
nuclear corrections extends to rather small scales y/S, and grows quickly above 5-10% as \/I 
exceeds 1 TeV. 

The luminosity ratios in Fig. 12 are relevant for the total cross section for producing an 
object with a mass equal to V^- However, one can also investigate a specific region of x by 
considering the production of an object of the same mass, but at a specified rapidity y (not 
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to be confused with the momentum fraction y carried by a nucleon in the deuteron, as in 
Sec. II). The differential parton luminosity is then given by 



where Xi^2 — tc^^, with r = \Js/s. The differential luminosities, normalized to calculations 

using our reference fit, are shown in Fig. 13 as a function of r for three values of rapidity 
(since the ratios are largely independent of Q^, these plots are also independent of s). The 
parton luminosities not plotted there show a qualitatively similar behavior. Going to larger 
rapidities increases the sensitivity to the large- a; region for a given mass. Therefore, the 
nuclear corrections come into play for smaller masses as the rapidity is increased. For 
example, for W production at the Tevatron the nuclear uncertainty becomes relevant at 
rapidity larger than 2, as already observed in Sec. IV D. 

V. DISCUSSION 

In the previous sections we have discussed in detail the fundamental problem with using 
deuterium DIS data to constrain the (i-quark PDF at large x, namely, the deuterium data 
only constrain the combination of the d PDF and the model used to compute the nuclear 
corrections. Variations in the nuclear models are easily compensated by changes in the 
d distribution, while the u distribution is already well constrained by the hydrogen DIS 
data and other observables. It is also interesting to note that the resulting variations in 
the d PDF feed over to accompanying anticorrelated variations of the gluon PDF. In order 
to disentangle the d quark from the nuclear correction model, one either needs additional 
experimental observables which do not rely on nuclear targets, or experiments on nuclear 
targets for which it can be arranged that the results are less sensitive to nuclear corrections. 

A. Constraints on large-x d quarks 

One approach to directly constrain c^-quark PDFs, avoiding nuclear corrections altogether, 
would be to utilize data for vp DIS, which at large values of x is dominated by the d PDF. 
While such data exist [60], they were obtained using a hydrogen bubble chamber and are 
statistically limited at large values of x. High statistics data for this process from new 
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high intensity neutrino beam hnes, such as NUMI at Fermilab, would provide a valuable 
constraint. In addition, data on the ratio of vp to Vp cross sections from a single experiment 
would provide direct sensitivity to d/u at large x, while allowing for a partial cancellation 
in the uncertainties associated with beam flux. 

A similar source of information on the d PDF would be DIS data for charged current e^p 
interactions [61]. These data were taken as part of the HERA experimental program, and 
will help constrain the d-quark PDF at small x, providing additional information concerning 
the observed tension between the deuterium DIS data and the CDF W asymmetry data. At 
large x they do not provide strong enough constraints due to limited statistics, even though a 
novel analysis is underway to extend their reach to the largest possible x [62] . Nevertheless, 
their inclusion in a full global fit should provide additional constraints on the d/u ratio. 

Another observable sensitive to d/w is the Z rapidity distribution, since the weak coupling 
of the Z boson to the quarks makes it sensitive to a different combination of u and d quarks 
from the Drell-Yan lepton pair data. The cross section for Z-boson production involves a 
different linear combination of parton luminosities than either W production or lepton pair 
production, thereby offering yet another constraint on the d/u ratio. We intend, in a future 
study, to investigate this observable and include in our fits the available data, which currently 
have a limited statistical significance at large x. A better experimental understanding of the 
W asymmetry and 1^-lepton asymmetry data is also needed, since at large rapidity these 
data could help discriminate nuclear correction models. However, since they pull the d/u 
ratio in opposite directions, they cannot yet be used to this purpose. 

At lower energies, a number of experiments have been proposed at Jefferson Lab which 
seek to reduce or eliminate the need for the nuclear corrections that limit current extractions 
of F^, and the next 5 years promises to significantly improve the experimental situation with 
regards to the d PDF at x > 0.65. This will be provided through both flavor sensitive probes 
in electron-proton scattering and electron scattering from effective neutron targets. Several 
of these experiments have already been approved for running at Jefferson Lab after the 
upgrade of the accelerator to 12 GeV, allowing for larger Q/^ and x to be probed for W"^ 
outside the nucleon resonance region. 

One such method proposed for minimizing the nuclear uncertainties in is the 
"MARATHON" [63] experiment, which aims to extract F^/F^ from a measurement of 
the F2 structure functions of tritium and ^He. The use of these mirror nuclei allows for 
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the cancellation of nuclear effects to the ^ 1% level [64] , as well as many of the systematic 
uncertainties. 

A complementary method which has recently been successfully implemented in the 
"BONUS" [65] experiment in Jefferson Lab Hall B is based on the spectator tagging tech- 
nique. Here, low momentum protons tagged in the backward '~fd center-of-mass hemisphere 
in the scattering of electrons from a deuteron target ensures that the scattering took place 
off a nearly free neutron. The effects of final state interactions of the neutron debris with the 
spectator protons and nucleon off-shell corrections are minimized by restricting the protons 
to momenta to below 100 MeV/c and angles above 120°. Utilizing the measured momentum 
vector of the proton and the virtual photon, the initial momentum of the target neutron can 
be effectively determined, allowing for the kinematic correction of Fermi motion effects. The 
results from the BONUS experiment with a maximum beam energy of 5 GeV are limited to 
a maximum x of about 0.65; however, the power of the technique has been demonstrated 
and an extension to x 0.85 and higher Q"^ has been approved for running at the 12 GeV 
energy [66]. 

Another novel, flavor-sensitive observable is provided by the parity- violating DIS asym- 
metry (resulting from the interference of 7* and Z-boson exchange) on a hydrogen target 
[67], which yields a new combination of u and d PDFs at large x, and is free of any nuclear 
corrections. A 6 GeV run has been completed and subsequent runs are proposed for the 
12 GeV era, over a range extending to x fa 0.8 [68]. 

B. Constraints on large-a; gluons 

As discussed in Sec. IV, due to the interplay of deuteron DIS data and laige-px jet data 
the gluon PDF is also very sensitive to nuclear corrections. Therefore these can also be 
constrained by considering independent observables sensitive to large x gluons. Two such 
observables are the longitudinal DIS structure function Fl, and the charm structure function 
F2 obtained by requiring at least one charmed particle in the final state of inclusive DIS. 

The disadvantage of the structure function is that the gluon PDF typically decreases 
at large x much faster than the u and d quarks, which dominate this region. Therefore, in 
practice, the sensitivity of Fl to the gluons decreases fast as x approaches 1. The charm 
structure function is more directly related to the gluon distribution, but variations in the 
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heavy-quark scheme used in the fit may in fact accommodate a number of possible gluon 
behaviors, as has been observed in recent fits of structure functions at HERA [69]. It would 
nonetheless be worthwhile to include Fl and data in future fits to explore their power in 
constraining large-x gluons. 

C. Constraints on nuclear corrections 

The future free nucleon data discussed in Section V A open the possibility of no-deuteron 
fits with largely reduced d-quark PDF errors compared to that in Fig. 8, which uses currently 
available data sets. Having accurately determined the d-quark PDF at large x one can then 
confront the validity of various nuclear correction models directly. For instance, the free 
neutron F2 structure functions can be calculated from the fitted PDFs, and used in the 
computation of the inclusive deuteron structure function. Any residual differences compared 
to data would then refiect directly the choice of nuclear model. Likewise, Drell-Yan pd data 
at negative rapidity, which may be reached in collider experiments such as at RHIC, could 
be used to further constrain the nuclear corrections in deuterium. 

Global QCD fits, however, provide an even more powerful method of constraining the 
deuterium corrections, since they allow one to use a more diverse set of data, and to take full 
advantage of the statistical power of all the available data. For example, one could perform 
a global fit additionally including data sensitive to gluons, such as described in Section VB, 
and the deuteron DIS data themselves. For each given choice of nuclear corrections the fit 
would find PDFs, with very small PDF errors, that try to describe as well as possible both 
the deuteron data and the free nucleon data. A nuclear model that would, say, overcorrect 
the deuteron structure function would then produce PDFs poorly describing cither data set. 
The phenomenologically viable nuclear corrections can then be selected among those that 
minimize the tensions between these data. 

Such tests of nuclear models of the deuteron within global PDF fits will be complementary 
to direct experimental tests which will be made possible by the future data, and are an 
important prerequisite for an understanding of heavier nuclei. 
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VI. CONCLUSIONS 



Deep inelastic scattering and lepton pair data taken on proton targets are generally 
sensitive to the combination 4u + d at large values of x. An additional linear combination 
is required in order to reliably separate the u and d PDFs. Traditionally, this role has been 
played by DIS data taken on a deuterium target. In this analysis we have investigated 
the uncertainties induced in the d PDF by variations in the choices made for modeling the 
deuterium nuclear corrections. We have shown that reasonable choices for the deuteron wave 
function and for modehng the off-shell effects result in large uncertainties in the d PDF as 
x — > 1. As expected, the use of deuterium data reduces the uncertainty on the d PDF 
for any given set of choices for the nuclear models. However, the totahty of the induced 
uncertainties for all of the models is large. 

The u PDF remains well determined despite the large uncertainties in the d PDF. This 
is due to the fact that the large-x DIS proton data and the large xp lepton pair production 
data on both proton or deuteron targets are sensitive to Au + d, and at large values of x 
the ratio d/u C(O.l) so that the role played by the d PDF there is down by a factor of 
~ 40 relative to that of the u PDF. Thus, a small change in the u PDF can compensate a 
large change in the d PDF. On the other hand, the d and gluon PDFs are anticorrelated by 
the high-p2- jet data. The large uncertainties of the d PDF lead to large uncertainties of the 
gluon PDF which extend down to a; ~ 0.4. This has potentially profound implications for 
future collider experiments. 

This situation may be improved when data which are sensitive to the d/u ratio or gluons 
at large x, but which are not sensitive to nuclear corrections, become available. Several 
such experiments are planned and should take data in the near future. Inclusion of these 
data in global QCD fits will help constrain the phenomenologically viable range of nuclear 
correction models, thereby reducing the currently large uncertainties in the d-quark and 
gluon distributions. 
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